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TECHNICAL FIELD 

This invention relates to dynamic detection of maximum bandwidth for a 
connection between entities on a TCP network environment. In particular, it 
relates to countermeasures to flow-control functions of the network environment 
that may effectively delay transmission of a set of packets. 

BACKGROUND 

As the Internet has matured, the characteristics of the available content on 
the Internet have changed. Sound and video content is now included with the 
traditional textual content. However, this new content on the Internet requires a 
greater connection speed (i.e., bandwidth) than was commonly available a few 
years ago. 

Fig. 1 illustrates an example of a typical Internet configuration. It includes 
a server (such as media server 20), which is coupled to the Internet 30. The server 
typically includes one or more physical server computers 22 with one or more 
physical storage devices and/or databases 24. On the other side of an Internet 
transmission is a client 90, which is connected via one of many available Internet 
Service Providers (ISPs) 80. Herein, a server is a network entity that sends data 
and a client is a network entity that receives data. 

Cloud 30 is labeled the Internet, but it is understood that this cloud 
represents that portion of the Internet that only includes that which is illustrated 
therein. Inside such cloud are the routers, transmission lines, connections, and 
other communication devices that more-often-than-not successfully transmit data 
between clients and servers. Inside exemplary Internet cloud 30 are routers 32-44; 
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two satellite dishes 46 and 50; and a satellite 48. The links between these devices 
represent the possible paths that a data packet may take on its way between the 
server and the client. 

In general, a communication device on a network (such as the Internet) is 
any device that facilitates communication over the network between two entities, 
and includes the two entities. Examples of such entities include the server 20 and 
the client 90. 

The layers of the OSI Model 

Open System Interconnection (OSI) model is an ISO standard for 
worldwide communications that defines a networking framework for 
implementing protocols in seven layers. Control is passed from one layer to the 
next, starting at the application layer in one station, proceeding to the bottom 
layer, over the channel to the next station and back up the hierarchy. A person of 
ordinary skill in the art is familiar with the OSI model. 

Most of the functionality in the OSI model exists in all communications 
systems, although two or three OSI layers may be incorporated into one. These 
layers are also called "levels." 

Generally, the hardware implements the physical layer. Such hardware 
may include a network card, a modem, or some other communications device. 
Typically, the kernel of an operating system (OS) implements the transport layer. 

The top of the stack is the applications in the application layer. This 
includes any application that communicates with entities outside of the computer, 
such as a Web browser, a media player, and an email program. The application 
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layer has the least control of details of communication between entities on a 
network, such as the Internet. 

Bandwidth 

Bandwidth is the amount of data that can be transmitted in a fixed amount 
of time. For example, bandwidth between media server 20 in Fig. 1 to media 
client 90 is calculated by the amount of data (e.g., 1000 bits) that may be 
transmitted between them in a unit of time (e.g., one second). More specifically, 
data may be transmitted between devices at a rate of approximately 56,000 bits per 
second. That may be called 56 kilo-bits per second (Kbps). 

As shown in Fig. 1, a transmission over the Internet travels across multiple 
links before it reaches its destination. Each link has its own bandwidth. Like a 
chain being only as strong as its weakest link, the maximum bandwidth between 
server 20 and client 90 is the link therebetween with the slowest bandwidth. 
Typically, that is the link between the client 90 and its ISPs 80. That slowest 
bandwidth is the maximum de facto bandwidth. 

Herein, unless otherwise apparent from the context, references to 
bandwidth between network entities (such as server 20 and client 90) is assumed 
to be the maximum de facto bandwidth therebetween. 

Bandwidth may also be called "connection speed", "speed", or "rate". In 
references to bandwidth measured by bits per second, it may also be called "bit 
rate" or "bitrate." 
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Streaming Media 

Streaming is a technique for transferring multimedia data such that it can be 
processed as a steady and continuous stream. Streaming technologies are 
becoming increasingly important with the growth of the Internet because most 
users do not have fast enough access to download large multimedia files quickly. 
With streaming, the client browser or plug-in can start displaying the data before 
the entire file has been transmitted. 

For streaming to work, the client side receiving the data must be able to 
collect the data and send it as a steady stream to the application that is processing 
the data and converting it to sound or pictures. This means that if the streaming 
client receives the data more quickly than required, it needs to save the excess data 
in a buffer. If the data doesn't come quickly enough, however, the presentation of 
the data will not be smooth. 

Within the context of an audio and/or visual presentation, "media" and 
"multimedia" are used interchangeably herein. Media refers to the presentation of 
text, graphics, video, animation, and/or sound in an integrated way. 

"Streaming media" is an audio and/or visual presentation that is transmitted 
over a network (such as the Internet) to an end-user. Such transmission is 
performed so that the presentation is relatively smooth and not jerky. Long pauses 
while additional frames are being downloaded to the user are annoying to the user. 
These annoyances encourage a user to avoid viewing future streaming media. 
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Smoothly Transmitting Streaming Media 



Since the bandwidth determines the rate at which the client will receive 
data, a streaming media presentation may only be presented at a rate no greater 
than what the bandwidth allows. For example, assume media server 20 needs to 
send data at 50Kbps to the client 90 in order to smoothly "play" a streaming media 
presentation. However, the bandwidth between the client and server is only 
30Kbps. The result is a jerky and jumpy media presentation. 

In an effort to alleviate this problem, streaming media presentations are 
often encoded into multiple formats with differing degrees of qualities. 

The formats with the lowest quality (e.g., small size, low resolution, small 
color palette) have the least amount of data to push to the client over a given time. 
Therefore, a client over a slow link can smoothly present the streaming media 
presentation, but the quality of the presentation suffers. 

The formats with the highest quality (e.g., full screen size, high resolution, 
large color palette) have the greatest amount of data to push to the client over a 
given time. Therefore, the client with a fast link can smoothly present the 
streaming media presentation and still provide a high quality presentation. 

Select-a-Bandwidth Approach 

When a server sends streaming media to a client, it needs to know what 
format to use. Thus, in order to select the proper format, the server must to know 
the bandwidth between the server and the client. 

This easiest way to accomplish this is to ask the user of the client what their 
bandwidth is. Since a client's link to the Internet is typically the bandwidth 
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bottleneck, knowing the bandwidth of this link typically indicates the actual 
bandwidth. 

Fig. 2 shows a cut-away 100 of a Web page displayed on a client's 
computer. Inside the cut-away 100, is a typical user-interface 110 that may be 
used to ask a user what their connection speed is. The user clicks on one of the 
three buttons 112, 114, and 116 provided by the user-interface 110. If the user 
clicks on button 112, the server delivers data from a file containing streaming 
media in a format designed for transmission at 28.8 Kbps. Likewise, if the user 
clicks on button 114, data sends from a file containing streaming media in a format 
designed for transmission at 56.6 Kbps. If the user clicks on button 114, the server 
delivers data from a file containing streaming media in a format designed for 
transmission at a rate greater than 56.6Kbps and up-to the typical speed of a Tl 
connection. 

However, the primary problem with the "select-a-bandwidth" approach is 
that it requires a thoughtful selection by a user. This approach invites selection 
errors. 

It requires that a user care, understand, and have knowledge of her 
connection speed. Often, a user does not pay particular attention to which button 
to press. The user may only know that a media presentation will appear if the user 
presses one of these buttons. Therefore, they press any one of them. 

Often, a user does not understand the concept of bandwidth. A user may 
choose button 116 because she may want to see the presentation at its highest 
quality. This user does not realize that seeing the presentation at its highest quality 
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may result in a non-smooth presentation because her Internet connection cannot 
handle the rate that the data is being sent through it. 

If she does understand the concept of bandwidth, then the user may not 
know her bandwidth. A user may simply be ignorant of her bandwidth. In 
addition, varying degrees of noise may cause varying connection speeds each time 
a user connects to the Internet. Furthermore, some types of connections (such as a 
cable modem) can have wide degrees of connection speed depending upon 
numerous factors. 

Moreover, the user needs to understand the implications of an incorrect 
choice. A user needs to be educated so that she understands that she needs to select 
an option that is equal to or less than her bandwidth to get a smooth presentation. 
But she should not choose one that is significantly less than her bandwidth. If she 
does, then she will be seeing a smooth presentation at a lower quality that she 
could otherwise see at a higher available bandwidth. 

As can be seen by the above discussion, this manual approach is often 
confusing and intimidating to many user. Therefore, it often results in incorrect 
selections. 

What ! s more, maintaining multiple files (one for each bandwidth) at the 
media server adds to the overhead of maintaining a Web site. 

Automatic Bandwidth Detection 

To overcome these problems, media servers may use a single file 
containing subfiles for multiple bandwidths. In addition, media servers may 
automatically detect the bandwidth. 
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This single file is called a MBR (multiple bit rate) file. The MBR files 
typically include multiple differing "bands" or "streams." These bands may be 
called "subfiles." A user only clicks on one link. Automatically, behind the 
scenes, the server determines the right speed band to send to the client. 

This automatic speed detection may take a long time. This means that an 
additional five seconds to a minute (or more) is added to the user's wait for the 
presentation to begin. This delay for existing automatic speed detection is because 
of long "handshaking" times while the speed determination is going on. 

One existing automatic detection technique involves sending multiple data 
packets for measuring the speed between the server and client. This technique is 
described further below in the section titled, "Multiple Measurement Packets 
Technique." 

Bandwidth Measurement Packets 

Typically, automatic bandwidth detection techniques measure bandwidth 
between entities on a network by sending one or more packets of a known size. 

Fig. 3 shows a time graph tracking the transmission of two such packets (P x 
and P y ) between a sender (e.g., server) and a receiver (e.g., client). The server and 
client sides are labeled so. On the graph, time advanced downwardly. 

Time t a indicates the time at the server the transmission of P x begins. Time 
t b indicates the time at the server the transmission of P x ends. Similarly, Time t 0 
indicates the time at the client begins receiving P x . Time t] indicates the time at 
the client completes reception of P x . At t l5 the network hardware presumably 
passes the packet up the communication layers to the application layer. 
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Packet P y is similarly labeled on the time graph of Fig. 3. t c is the server 
time at the transmission of P y begins. t d is the server time that the transmission of 
P y ends. Similarly, t 2 is the client time that it begins receiving P y . t 3 is the client 
time that it completes reception of P y . At t 3 , the network hardware presumably 
passes the packet up the communication layers to the application layer. 

Bandwidth measurement using a single packet . In a controlled, laboratory- 
like environment, measuring bandwidth between two entities on a network is 
straightforward. To make such a calculation, send a packet of a known size from 
one entity to the other and measure the transmission latency, which is the amount 
of time it takes a packet to travel from source to destination. Given this scenario, 
one must know the time that the packet was sent and the time that the packet 
arrived. 

This technique is nearly completely impractical outside of the laboratory 
setting. It cannot be used in an asynchronous network (like the Internet) because it 
requires synchronization between the client and server. Both must be using the 
same clock. 

Alternatively, the client may track the time it begins receiving a packet 
(such as t 0 for P x ) and the time the packet is completely received (such as ti for 

Px). 

Fig. 3 shows packet P x being sent from a server to a client. P x has a known 
size in bits of PS. The formula for calculating bandwidth (bw) is 
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PS 

bw(P x ) = 

ti -to 

Formula 1 (Single Packet) 

This technique works in theory, but unfortunately does not work in practice. 
Only the hardware knows when a packet is initially received. Therefore, only the 
hardware knows when t 0 is. 

The other communication layers (such as the transport layer and the 
application layer) can only discover the time when the packet is completely 
received by the hardware. That is when the hardware passes it up to them. This 
completion time for packet P x is t h It is not possible to calculate bandwidth only 
one knowing one point in time. 

Packet-pair . A technique called packet-pair is used to overcome these 
problems in asynchronous networks. With packet-pair, two identical packets are 
sent back-to-back. The server sends a pair of packets, one immediately after the 
other. Both packets are identical; thus, they have the same size (PS). The 
bandwidth is determined by dividing the packet size by the time difference in 
reception of each packet. 

Each packet has specific measurable characteristics . In particular, these 
characteristics include its packet size (PS) and the measured time such a packet 
arrives (e.g., t 0 _3 in Fig. 3). Some characteristics (such as packet size) may be 
specified rather than measured, but they may be measured if so desired. 

As shown in Fig. 3, the server sends packet, P x . The client's hardware 
begins receiving the packet at t 0 . When reception of the packet is complete at tj, 
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the hardware passes it up the communication layers. Ultimately, it is received by 
the destination layer (e.g., application layer) at presumably t^ 

After the server sends P x (which completed at t b ), it immediately sends 
packet P y at t c . It is important that there be either 1) absolutely no measurable 
delay between t b and t c or 2) a delay of a known length between t b and t c . Herein, 
to simplify the description, it will be assumed that there is no measurable delay 
between t b and t c . 

The client's hardware begins receiving P y at t 2 . When reception of the 
packet is complete at t 3? the hardware passes it up the communication layers. 
Ultimately, it is received by the destination layer (e.g., application layer) at 
presumably t 3 . 

Fig. 3 shows no delay between t] (the time of completion of reception of P x ) 
and t 2 (the time reception of P y begins). Theoretically, this will always be the case 
if P x and P y are transmitted under identical conditions. In practice, is the often the 
case because P y is sent immediately after P x . 

Using packet-pair, the formula for calculating bandwidth (bw) is 



PS 

bw(P x P y ) = 

ts-ti 

Formula 2 (Packet-Pair) 

This technique works in theory and in practice. However, it only works 
well over a network that is relatively static. 

For example, in Fig. 1, assume the network consists of only the server 20; 
routers 32, 34, and 36; a specific ISP of ISPs 80; and client 90. Further, assume 
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that the links between each node on this static network is fixed and has a 
consistent bandwidth. In this situation, the packet-pair techniques provide an 
accurate and effective measurement of bandwidth. 

Issues related to using Packet-pair over the Internet . However, the packet- 
pair technique does not work well over a dynamic network, like the Internet. A 
dynamic network is one where there is a possibility that a packet may be handled 
in a manner different from an earlier packet or different from a later packet. In 
particular, there are problems with a TCP network. 

Fig. 1 illustrates examples of handling differences found on a dynamic 
network. Assume that all packets are traveling from the server to the client (from 
left to right in Fig. 1). Assume that packets 60-68 were sent back-to-back by the 
server 20 to the client 90. 

Notice, as illustrated in Fig. 1, that packets may take different routes. In 
addition, some routes may significantly delay the packet transmission. This is 
especially true if the packet is transmitted via an apparently unusual (but not 
necessarily uncommon) route, such as wireless transmission, oversees via an 
underwater cable, satellite transmission (as shown by dishes 46 and 50 and 
satellite 48), etc. A router (such as router 42) may delay one or more packets 
(such as 63 and 64) more than another may by temporarily storing them in a 
memory (such as buffer 43). 
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Multiple Measurement Packets Technique 

To overcome these problems, conventional automatic bandwidth 
measurement techniques uses multiple packets. A server sends several (much 
more than two) packets and calculates the speed of each. Conventional wisdom on 
bandwidth measurement indicates that in order to get accurate measurements 
several pairs of packets must be sent repeatedly over several seconds to several 
minutes. Herein, this technique is called "multiple-packets" to distinguish it from 
the above-described "packet-pair" technique. 

Typically, the ultimate bandwidth is determined by finding the average of 
the many bandwidth measurements. This averaging smoothes out variances in 
delays for each packet; however, it does not compensate for packet compression 
during transmission. One of two extremely incorrect measurements will skew the 
average. 

Unfortunately, this technique takes a long time relative the existing wait for 
the user between click and media presentation. A long time may be five seconds 
to several minutes depending on the data and the situation. Such a delay adds to 
the annoyance factor for the user who wishes experience the media presentation. 
This is not an acceptable delay. Since there are no other options available using 
conventional techniques, the user has be forced to endure these delays. 

No existing automatic bandwidth measurement can nearly instantaneously 
measure bandwidth across the Internet using a pair of packets. No existing 
automatic bandwidth measurement can make such measurements at the 
application layer. Thus, it avoids modifying the operating system. No existing 
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automatic bandwidth measurement addresses measurement distortion caused by 
packet compression. 

Transport Layer Implementation 

The conventional approaches typically modify the kernel of the operating 
system (OS) to do perform automatic bandwidth measurements. More specifically, 
these approaches modify the transport layer of the OSI model and such layer is 
often located within the kernel of the OS. In general, such modifications are 
undesirable because it is generally less stable and more expensive than 
implementations that do not modify the OS. 

If these approaches could be implemented within an application (thus, at 
the application layer), such modifications would not be possible. However, no 
existing packet-pair approach measures bandwidth at the application layer. This is 
because the application layer has less control over the details of the actual 
communication over the network. In particular, an application has even less 
control using TCP, than it would with UDP (User Datagram Protocol). 

TCP and UDP are discussed below in section titled "TCP and UDP." The 
transport and application layers are part of the seven layers of the OSI model 
discussed below. 

TCP and UDP 

Over the Internet (and other networks), packets of data are usually sent via 
TCP or UDP protocols. TCP is the universally accepted and understood across the 
Internet. 
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TCP (Transmission Control Protocol) is one of the main protocols in 
TCP/IP networks (such as the Internet). Whereas the IP protocol deals only with 
packets, TCP enables two hosts to establish a connection and exchange streams of 
data. TCP guarantees delivery of data and guarantees that packets will be 
delivered in the same order in which they were sent. 

UDP (User Datagram Protocol) is a connectionless protocol that (like 
TCP) runs on top of IP networks. Unlike TCP/IP, UDP/IP provides very few error 
recovery services, offering instead a direct way to send and receive packets (i.e., 
datagram) over an IP network. 

A packet is a chunk of data provided by the application program. UDP 
typically sends a single "application-level packet" as a single UDP packet. 
However, TCP may break a single application-level packet into multiple smaller 
TCP "segments", each of which is treated as a separate "packet" at the TCP layer. 
The Nagle Algorithm (discussed below) does the opposite: It takes multiple small 
application packets and combines them into a single larger TCP segment. 

Nagle TCP/IP Algorithm 

The Nagle Algorithm was designed to avoid problems with small TCP 
segments (sometimes called "tinygrams") on slow networks. The algorithm says 
that a TCP/IP connection can have only one outstanding tinygram that has not yet 
been acknowledged. The defined size of a tinygram depends upon the 
implementation. However, it is generally a size smaller than the size of typical 
TCP segments. 
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The Nagle Algorithm states that under some circumstances, there will be a 
waiting period of about 200 milliseconds (msec) before data is sent. The Nagle 
Algorithm uses the following parameters for traffic over a switch: 

• Segment size = MTU or tcp_mssdflt or MTU path discovery value. 

• TCP Window size = smaller of tcp_sendspace and tcp_recvspace 
values. 

• Data size = application data buffer size. 

The following are the specific rules used by the Nagle Algorithm in 
deciding when to send data: 

• If a packet is equal to or larger than the segment size (or MTU), and 
the TCP window is not full, send an MTU size buffer immediately. 

• If the interface is idle, or the TCP_NODELAY flag is set, and the 
TCP window is not full, send the buffer immediately. 

• If there is less than half of the TCP window in outstanding data, send 
the buffer immediately. 

• If sending less than a segment size buffer, and if more than half the 
window is outstanding, and TCP_NODELAY is not set, wait up to 
200 msec for more data before sending the buffer. 

Setting TCP_NODELAY on the socket of the sending side deactivates the 
Nagle Algorithm. All data sent will go immediately, no matter what the data size. 

The Nagle Algorithm may be generically called the "tinygram-buffering" 
function because it buffers tinygrams. 
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TCP Slow Start Algorithm 

On TCP networks that don't use "slow start/' devices start a connection 
with a sender by injecting multiple packets into the network, up to the window 
size advertised by a receiver. While this is acceptable when the two hosts are on 
the same LAN (local area network), problems may arise if there are routers and 
slower links between the sender and the receiver. Since some of the intermediate 
router is likely to queue the packets, it is possible for that such a router will have 
insufficient memory to queue them. Therefore, this naive approach is likely to 
reduce the throughput of a TCP connection drastically. 

The algorithm to avoid this is called "slow start." It operates by observing 
that the rate at which new packets should be injected into the network is the rate at 
which the acknowledgments are returned by the other end. 

The Slow Start Algorithm adds another window to the sendees TCP: a 
congestion window, called "cwnd". When a new connection is established with a 
host on another network, the congestion window is initialized to one packet. Each 
time an acknowledgement (i.e., "ACK") is received, the congestion window is 
increased by one packet. The sender can transmit up to the minimum of the 
"congestion window" and the "advertised window." The "congestion window" is 
flow control imposed by the sender. The "advertised window" is flow control 
imposed by the receiver. The former is based on the sender's assessment of 
perceived network congestion. The latter is related to the amount of available 
buffer space at the receiver for this connection. 

The sender starts by transmitting one packet and waiting for its ACK 
(acknowledgement). When that ACK is received, the congestion window is 
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incremented from one to two. Now, two packets can be sent. When each of those 
two packets is acknowledged, the congestion window is increased to four. And so 
forth. 

At some point, the capacity of the connection between the sender and 
receiver may be reached. At that point, some intermediate router will start 
discarding packets. This tells the sender that its congestion window has reached 
its limit. 

Proxy 

A proxy (i.e., proxy server) is a device that sits between a client application 
(such as a Web browser) and a real server. Generally, it intercepts all requests to 
and from the real server to see if it can fulfill the requests itself. If not, it forwards 
the request to the real server. A proxy is employed for two main purposes: 
Improve performance and filter requests. 

Since the proxy server is often a central point of communication for a 
number of clients, it attempts to make its communications as efficient as possible. 
Thus, it typically implements a form of the Nagle Algorithm. Every new TCP 
connection start with Slow Start. When there is a proxy between the client and the 
server, slow start is run in the two connections: server-proxy and proxy-client. 
Therefore, the proxy adds new complexity to the packet pair experiment. 
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Background Summary 

An application (at the application layer) has limited control over the 
handling of TCP packets. Thus, conventional bandwidth measurements avoid 
application-level TCP bandwidth measurements. 

The integrity of the packet pair technique requires that at least two packets 
be sent back-to-back. However, these packets may not arrive in such a manner 
because of the affects of the Nagle Algorithm and the Slow Start Algorithm. This 
discourages the use of the packet-pair technique for bandwidth measurement over 
a TCP network. 

SUMMARY 

The fast dynamic measurement of bandwidth in a TCP network 
environment utilizes a single pair of packets to calculate bandwidth between two 
entities on a network (such as the Internet). This calculation is based upon the 
packet-pair technique. This bandwidth measurement is extremely quick. 

On its journey across a network, communication devices may delay the 
packet pairs. In particular, TCP networks have two algorithms designed to delay 
some packets with the goal of increasing the overall throughput of the network. 
However, these algorithms effectively delay a packet pair designed to measure 
bandwidth. Therefore, they distort the measurement. These algorithms are 
"Nagle" and "Slow Start." 

The fast dynamic measurement of bandwidth implements countermeasures 
to overcome the delays imposed by these algorithms. Such countermeasures 
include disabling the application of the Nagle Algorithm; minimizing the buffering 
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of packets by sending a "push" packet right after the packet pair; and avoiding the 
Slow Start Algorithm by priming it with a dummy packet. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates a typical public networking environment (such as the 
Internet) and the routing of and delay of data packets sent from a server to a client. 

Fig. 2 is cut-away portion of a Web page. The cut-away shows a user 
interface providing a user a mechanism for selecting the bandwidth. This shows a 
conventional technique for determining bandwidth. 

Fig. 3 shows a packet pair (being sent from a server to a client) graphed in 
the time domain. This shows a conventional implementation of packet-pair 
technique to measure bandwidth. 

Fig. 4 also illustrates a typical public networking environment (such as the 
Internet). This shows a pair of packets sent back to back. 

Fig. 5 is a flowchart illustrating the methodology of an implementation of 
the exemplary bandwidth meter. 

Figs. 5a, 5b, and 5c are a flowchart illustrating the specific methodology 
implementation details of different aspects of the exemplary bandwidth meter. 

Fig. 6 is an example of a computing operating environment capable of 
implementing the exemplary bandwidth meter. 

DETAILED DESCRIPTION 

The following description sets forth a specific embodiment of the fast 
dynamic measurement of bandwidth in a TCP network environment that 
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incorporates elements recited in the appended claims. This embodiment is 
described with specificity in order to meet statutory written description, 
enablement, and best-mode requirements. However, the description itself is not 
intended to limit the scope of this patent. Rather, the inventors have contemplated 
that the claimed fast dynamic measurement of bandwidth in a TCP network 
environment might also be embodied in other ways, in conjunction with other 
present or future technologies. 

Even when used with a TCP network (such as the Internet), an exemplary 
fast dynamic measurement of bandwidth in a TCP network environment (i.e., 
"bandwidth meter" or "bw-meter") described herein is fast and robust. The 
exemplary bandwidth meter implements a low-latency technique for automatically 
measuring the network bandwidth available between two entities on a 
communications network. It has been found to be particularly useful over the 
Internet (or other such TCP networks). 

Unlike the conventional approaches, the exemplary bw-meter obtains a best 
effort bandwidth measurement with the least possible delay, even under difficult 
network conditions. The exemplary bw-meter is designed to provide reasonable 
output in less than one second in most existing TCP networks, including LANs, 
cable, DSL, and modem connections. 

Furthermore, the exemplary bw-meter is implemented at the application 
layer. Although the exemplary bw-meter may be implemented on other layers, the 
one described herein is implemented on the application layer. In particular, it may 
be partially implemented by a Web browser or a media player. 



Lee & Hayes, PLLC 



21 



0808001 US MS1-565US.PAT.APP.DOC 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



Other aspects of the packet-pair technique that may be implemented by the 
exemplary bw-meter are described in more detail in co-pending patent application, 
entitled "Fast Dynamic Measurement of Connection Bandwidth" with serial 

number , which was filed , 2000 and is assigned to the 

Microsoft Corporation. The co-pending application is incorporated herein by 
reference. 

Packet-Pair Technique 

The exemplary bw-meter utilizes the well-established packet-pair technique 
described above and illustrated in Fig. 3. The exemplary bw-meter uses the 
packet-pair formula (Formula 2) described above to calculate the maximum de 
facto bandwidth between two entities on a communications network (such as the 
Internet). 

Unlike existing automatic bandwidth measurement techniques that use 
multiple packets, the exemplary bw-meter uses a single pair of packets for 
measuring bandwidth over the Internet. With the exemplary bw-meter, bandwidth 
measurements and calculations are made "nearly instantaneously" because only a 
single pair of measurement packets is sent. The term "nearly instantaneously" 
means that the bandwidth is determined as soon as the pair of packets arrive at the 
client. 

The exemplary bw-meter overcomes the drawbacks and limitations of using 
packet-pairs over a TCP network (such as the Internet) by implementing 
countermeasures to the Nagle Algorithm and the Slow Start Algorithm. 
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Packet Pair Journey , A packet of the packet-pair technique of the 
exemplary bw-meter travels from the sending entity (e.g., server) to the receiving 
entity (e.g., client). Fig. 4 shows an example of such a journey. Fig. 4 illustrates 
an environment similar to that shown in Fig. 1 . 

Fig. 4 illustrates an example of a typical Internet (TCP network) 
configuration. It includes a server (such as media server 220), which is coupled to 
the Internet 230. The server typically includes one or more physical server 
computers 222 with one or more physical storage devices and/or databases 224. 
On the other side of an Internet transmission is a client 290, which is connected 
via a proxy server 284, which is Internet Service Providers (ISPs) 280. 

Cloud 230 is labeled the Internet, but it is understood that this cloud 
represents that portion of the Internet that only includes that which is illustrated 
therein. Inside such cloud are the routers, transmission lines, connections, and 
other devices that more-often-than-not successfully transmit data between clients 
and servers. Inside exemplary Internet cloud 230 are routers 232-244; two 
satellite dishes 246 and 250; and a satellite 248. These represent the possible 
paths that a data packet may take on its way between the server and the client. 

Fig. 4 shows successive packets 260 and 262 of the pair sent in accordance 
with the exemplary bw-meter. The server 220 sends packet 262 immediately after 
packet 260. 

The proxy server 284 is connected via link 282 to its ISPs 280. Client 290 
and clients 292 use the proxy server to communicate with the Internet. 
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Application-Level Bandwidth Measurement 



Conventional bandwidth measurement approaches are typically 
implemented the transport layer or some other layer below the application level. 
However, the exemplary bw-meter is implemented at the application layer. There 
are at least two major benefits to such an application-level approach to TCP 
packet-pair bandwidth measurement. 

First, a lower level (such as transport level) packet-pair implementation is 
disfavored. It requires changes to the kernel of the OS and it does not lend itself 
easily to incremental deployment. As opposed to an application-level 
implementation, a lower packet-pair implementation involves greater expense in 
development, initial deployment, future development, and future deployment. 

Second, according to one study, only a quarter of the TCP connections 
studied would benefit from a bandwidth measurement. Therefore, it is not cost 
effective to implement such bandwidth measurement at a lower level if it only 
used no more than a quarter of the connections. Therefore, such bandwidth 
measurement is best to be included in the applications that applications that need 
it. Applications are much easier (and less expensive) to incrementally deploy than 
a new kernel of the operating system. 

It is generally safe to assume that the receiver's clock is sufficiently precise 
and the IP datagrams (i.e., packets) are passed up through the receiver's network 
stack (OSI layers) to the application unmolested. The second assumption is 
violated in the case of some applications, such as software from America Online 
version four (AOLv4) and earlier which behaved as if it only delivered data to the 
application on a timer. Thus, this imposed an artificial clock granularity on the 
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measurements. Fortunately, it appears that version five (and later) of AOL 
software do not perform such molestation. 

Conditions for Effective Measurements Using Packet-Pair 

When using the packet-pair technique to measure bandwidth, two 
conditions must be met to achieve a good measurement. 

The first condition is that the packets must be sent back-to-back. Herein, 
this is called the "back-to-back" condition. If packets are not back-to-back, then 
the timing measurements between them will be distorted. Both the Nagle 
Algorithm and the Slow Start Algorithm threaten this condition. Both potentially 
delay delivery of the second measurement packet. For bandwidth measurement 
using a packet-pair, any avoidable delay between packets is intolerable because it 
distorts the measurement of the actual bandwidth. 

The second condition is that the size of the packets must be preserved. 
That is, they must not be combined with other packets. Herein, this is called the 
"size-preservation" condition. The Nagle Algorithm threatens this condition. 

The Nagle algorithm may cause multiple application-layer packets to be 
sent as a single TCP segment. Thus, while the application may think it is sending 
two or more packets, in fact, the TCP layer is only sending a single packet. 

Countermeasures to the Nagle Algorithm 

An interesting behavior of the Nagle Algorithm is that for small packets, 
only one ACK may be outstanding. Thus, a pair of small packets cannot be sent 
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back-to-back with the Nagle Algorithm. The Nagle Algorithm will combine small 
packets that are waiting for an ACK. This affects both the "back-to-back" and the 
"size-preservation" conditions. 

The exemplary bw-meter puts a countermeasure into action to overcome 
the Nagle Algorithm's tendency to interfere with the two conditions. An entity 
(such as the server 220 in Fig. 4) sends a command that instructs communication 
devices (such as routers 230-250) to disable the Nagle Algorithm. Generally, the 
server passes a command generically called a "delay-disable" command. 
Specifically, the server passes TCPNODELAY to SetSockOpt(). 

As long as the congestion window is open, turning off the Nagle Algorithm 
prevents TCP from attempting to combine any of the packet-pair packets and TCP 
will immediately write the packet to the network. 

In other words, with the Nagle Algorithm disabled by a "delay-disable" 
command, both packets of the packet-pair will flow though routers without the 
delay caused by Nagle's collecting of multiple packets. 

Countermeasure to the Slow Start Algorithm 

The exemplary bw-meter puts a countermeasure into action to overcome 
the Slow Start Algorithm's tendency to interfere with the "back-to-back" 
condition. This is done by opening the server's congestion window (which is 
specifically called "cwnd") to at least three packets. 

This is done by "priming" the congestion window. To prime the congestion 
window, a server sends at least one packet and receives an ACK before it sends the 
pair of packets of the packet-pair. Therefore, the server sends at least one 
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"priming" packet to the client and that packet is not used for calculating 
bandwidth. After one or more priming packets are sent, the server sends the actual 
packet-pair used for measuring bandwidth. At this point, the Slow Start Algorithm 
will let, at least, two packets in a row go through without delaying them. 

The Slow Start Algorithm can be completely avoided by performing the 
bandwidth measurement later in the particular TCP connection. However, this is 
not a desirable option because of two reasons: additional delay and overhead 
causing a faulty measurement. 

If the measurement is made later, there is a built-in delay to wait for the 
Slow Start Algorithm to run its course. It is better to not have any delays that can 
be avoided. With the exemplary bw-meter, this delay can be avoided. 

Performing the bandwidth measurement at the beginning of a TCP 
connection removes many uncertainties that accumulate as the connection 
progresses. For example, if the TCP connection is shared by both control and data 
transport, it is impossible to predict later in the session whether the sender's 
congestion window will allow packets to be sent back-to-back. 

Countermeasures to Delays at a Proxy 

The Nagle Algorithm operating at a proxy can similarly distort a packet- 
pair bandwidth measurement. Generally, proxies do not recognize a "delay- 
disable" command. Neither the client nor the server application can tell in advance 
if the connection is made through a circuit-level proxy. 

In order to address the Nagle Algorithm at a proxy, a large third packet is 
sent after the pair of measurement packets. If the proxy is holding the second 
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packet of the packet-pair, the third packet pushes it along. Hence, this third packet 
is called the "push" packet. 

In addition, the first and second packets could be combined at the proxy. 
The result would be an artificially high measurement, but the overwhelming 
majority of proxy users have a high bandwidth connection anyway. 

Methodological Implementation 

Fig. 5 shows a methodological implementation of the exemplary bandwidth 
meter. It is from the server perspective. At 300, the dynamic bandwidth 
measurement in accordance with the exemplary bandwidth meter is initiated. 
Typically, a user of the client selects an option on a Web page to experience a 
media presentation. Alternatively, an application on the client may initiate such 
bandwidth measurement. Such an application may be a Web browser, media 
player, or the like. 

Generally, at 302 of Fig. 5, the server sends a pair of packets to the client, 
with one immediately following the other. The specific implementation details at 
this block 302 are shown in Figs. 5a, 5b, and 5c. These figures are discussed 
below. 

At 306, the server waits for a response from the client. If it is not received 
within time limit, the process returns to send another pair of packets at 302. 
Although not shown in the flowchart, the process will repeat this a given number 
of times before terminating and generating an error. If a response is received 
within the time limit, the process proceeds to the next block at 308. 
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The response includes a bandwidth measurement determined by the client 
using the pair of packets sent by the server at 304. The server extracts the specified 
bandwidth from the response at 308. 

At 310 of Fig. 6, the server selects the file (or portion thereof) formatted for 
a bandwidth equal to or just lesser than the specified bandwidth. At 312, the server 
sends the file (or portion thereof) to the client. 

If it was a media file, the user of the client enjoys a media presentation that 
begins play quickly. It also plays smoothly and at the highest quality possible at a 
measured bandwidth. The process ends at 3 14. 

Countermeasure to Nagle Algorithm . Fig. 5a shows the specific 
methodological implementation of the exemplary bandwidth meter for the 
countermeasure to the Nagle Algorithm. At 402, the server sends a delay-disable 
command to disable the use of the Nagle Algorithm. At 404, the server sends a 
pair of bandwidth-measurement packets to the client. At 406, the process returns 
to block 306 of Fig. 5. 

Countermeasure to Proxy Delays . Fig. 5b shows the specific 
methodological implementation of the exemplary bandwidth meter for the 
countermeasure to the Proxy delays. At 412, the server sends a pair of bandwidth- 
measurement packets to the client. At 414, the server sends a "push" packet to 
force the pair out of any buffer in which they may be stored by a communications 
device. At 416, the process returns to block 306 of Fig. 5. 

Countermeasure to Slow Start Algorithm . Fig. 5c shows the specific 
methodological implementation of the exemplary bandwidth meter for the 
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countermeasure to the Slow Start Algorithm. At 422, the server sends a "priming" 
packet to overcome the Slow Start Algorithm. 

This "priming" packet is not used for bandwidth measurement. It allows 
the network to open up (i.e., the congestion window to open) and allow two 
packets at a time without delay. At 424, the server sends a pair of bandwidth- 
measurement packets to the client. At 426, the process returns to block 306 of Fig. 
5. 

Other Implementation Details 

Implementation Applications . The exemplary bw-meter may be 
implemented by any entity wishing to quickly measure bandwidth between two 
entities on a network. In particular, a TCP network, such as the Internet. 

Such an entity may implement this exemplary bw-meter at the application 
level. Examples of an application-level program modules that may implement this 
exemplary bw-meter is streaming media server application on a server using either 
Microsoft Media Server (MMS) protocol or Real Time Streaming Protocol 
(RTSP). 

Both MMS and RTSP share the very similar fundamental techniques to 
provide the conditions for a successful measurement using the exemplary bw- 
meter. However, implementation of the exemplary bw-meter using RTSP is 
trickier than such an implementation using MMS protocol. 

RTSP Packet Pair Syntax . One way that RTSP is trickier than MMS is 
because the three packets must masquerade as a response to an RTSP command so 
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the client's RTSP parser may process them. The RTSP GET PARAMETER 
command is used to request the packet pair experiment. The first packet of the 
reply begins with the typical RTSP response headers. 

Here are examples of the headers for a packet-pair request from the client: 



G ET_PARAM ETE R * RTSP/1.0 
Content-Type: application/x-rtsp-packetpair 
Content-Length: 16 

Date: Sun, 02 Apr 2000 22:36:18 GMT 
CSeq: 2 

User-Agent: WMPIayer/5.0.0.0488 guid/1A21De80-08E7-11D4-93FE- 
006097B76A2E 

Accept-Language: en-us, *;q=0.1 
Accept-Charset: UTF-8, *;q=0.1 
Timestamp: 1 



Here are examples of the headers for the packet pair reply from the server: 



RTSP/1.0 200 OK 

Content-Type: application/x-rtsp-packetpair 
Content-Length: 2048 
Date: Sun, 02 Apr 2000 22:30:48 GMT 
CSeq: 2 

TimeStamp: 1 0.063 

Server: WMServer/5.0.0.0518 



TCP issues . As noted earlier, the congestion window needs to be open to at 
least three packets by the time the three packets are sent from the server. Since the 
initial congestion window is two, the DESCRIBE response is used to open the 
window to three or greater. If the DESCRIBE response requires three packets, 
that means that the third packet must wait for an ACK from the client before it can 
be transmitted. 
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While the server's TCP is waiting for the ACK of either or both of the first 
two packets, if the GET_PARAMETER arrives and then the application starts 
writing the reply to the GET PARAMETER to the socket, the packet pair packets 
may get combined with the third and last packet of the DESCRIBE reply and with 
one another. Therefore, the client should not send the GET_PARAMETER until 
the DESCRIBE reply is fully received. 

This guarantees that the congestion window will be open at the server when 
the packet pair packets are sent. Consequently, no packets will be combined. The 
DESCRIBE response may be one or greater packets and the congestion window 
will be three or greater when the packet pair is performed. Obviously, no other 
traffic should occur before the packet pair. 

Measuring Arrival Times . Part of performing the packet pair measurement 
of the exemplary bw-meter (at the application level) means that the client 
application is measuring the arrival times of the two packets. RTSP presents an 
extra challenge in that the response headers take a relatively long time to process 
compared to the granularity needed for an accurate measurement. Therefore, the 
client cannot wait until processing the response header to figure out that it is a 
response to a packet pair request before it time stamps this first packet of the 
packet pair. 

The timestamp must occur before the client even knows what kind of 
response it is. Therefore, when the client makes a packet pair request, it 
timestamps every incoming command response packet until it receives the packet 
pair. Then it quits this pre-timestamp mode. 
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The client must still process the header of the first packet before it can read 
the second packet. Therefore, there is an upper bound to how high of a bottleneck 
can be measured and it is determined by how fast the client can process the RTSP 
response header. For instance, if the time it takes to process the header is 5ms, the 
maximum speed that can be measured is around 800 kb/s. Therefore, RTSP 
measurements at the high end will not be as good as MMS unless the time it takes 
to parse the RTSP response is low. 

Exemplary Computing Environment 

Fig. 6 illustrates an example of a suitable computing environment 920 on 
which the exemplary bw-meter may be implemented. 

Exemplary computing environment 920 is only one example of a suitable 
computing environment and is not intended to suggest any limitation as to the 
scope of use or functionality of the exemplary bw-meter. Neither should the 
computing environment 920 be interpreted as having any dependency or 
requirement relating to any one or combination of components illustrated in the 
exemplary computing environment 920. 

The exemplary bw-meter is operational with numerous other general 
purpose or special purpose computing system environments or configurations. 
Examples of well known computing systems, environments, and/or configurations 
that may be suitable for use with the exemplary bw-meter include, but are not 
limited to, personal computers, server computers, thin clients, thick clients, hand- 
held or laptop devices, multiprocessor systems, microprocessor-based systems, set 
top boxes, programmable consumer electronics, wireless phone, wireless 
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communication devices, network PCs, minicomputers, mainframe computers, 
distributed computing environments that include any of the above systems or 
devices, and the like. 

The exemplary bw-meter may be described in the general context of 
computer-executable instructions, such as program modules, being executed by a 
computer. Generally, program modules include routines, programs, objects, 
components, data structures, etc. that perform particular tasks or implement 
particular abstract data types. The exemplary bw-meter may also be practiced in 
distributed computing environments where tasks are performed by remote 
processing devices that are linked through a communications network. In a 
distributed computing environment, program modules may be located in both local 
and remote computer storage media including memory storage devices. 

As shown in Fig. 6, the computing environment 920 includes a general- 
purpose computing device in the form of a computer 930. The components of 
computer 920 may include, by are not limited to, one or more processors or 
processing units 932, a system memory 934, and a bus 936 that couples various 
system components including the system memory 934 to the processor 932. 

Bus 936 represents one or more of any of several types of bus structures, 
including a memory bus or memory controller, a peripheral bus, an accelerated 
graphics port, and a processor or local bus using any of a variety of bus 
architectures. By way of example, and not limitation, such architectures include 
Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) 
bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) 
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local bus, and Peripheral Component Interconnects (PCI) buss also known as 
Mezzanine bus. 

Computer 930 typically includes a variety of computer readable media. 
Such media may be any available media that is accessible by computer 930, and it 
includes both volatile and non-volatile media, removable and non-removable 
media. 

In Fig. 6, the system memory includes computer readable media in the form 
of volatile, such as random access memory (RAM) 940, and/or non-volatile 
memory, such as read only memory (ROM) 938. A basic input/output system 
(BIOS) 942, containing the basic routines that help to transfer information 
between elements within computer 930, such as during start-up, is stored in ROM 
938. RAM 940 typically contains data and/or program modules that are 
immediately accessible to and/or presently be operated on by processor 932. 

Computer 930 may further include other removable/non-removable, 
volatile/non-volatile computer storage media. By way of example only, Fig. 6 
illustrates a hard disk drive 944 for reading from and writing to a non-removable, 
non-volatile magnetic media (not shown and typically called a "hard drive"), a 
magnetic disk drive 946 for reading from and writing to a removable, non-volatile 
magnetic disk 948 (e.g., a "floppy disk"), and an optical disk drive 950 for reading 
from or writing to a removable, non-volatile optical disk 952 such as a CD-ROM, 
DVD-ROM or other optical media. The hard disk drive 944, magnetic disk drive 
946, and optical disk drive 950 are each connected to bus 936 by one or more 
interfaces 954. 
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The drives and their associated computer-readable media provide 
nonvolatile storage of computer readable instructions, data structures, program 
modules, and other data for computer 930. Although the exemplary environment 
described herein employs a hard disk, a removable magnetic disk 948 and a 
removable optical disk 952, it should be appreciated by those skilled in the art that 
other types of computer readable media which can store data that is accessible by a 
computer, such as magnetic cassettes, flash memory cards, digital video disks, 
random access memories (RAMs), read only memories (ROM), and the like, may 
also be used in the exemplary operating environment. 

A number of program modules may be stored on the hard disk, magnetic 
disk 948, optical disk 952, ROM 938, or RAM 940, including, byway of example, 
and not limitation, an operating system 958, one or more application programs 
960, other program modules 962, and program data 964. 

A user may enter commands and information into computer 930 through 
input devices such as keyboard 966 and pointing device 968 (such as a "mouse"). 
Other input devices (not shown) may include a microphone, joystick, game pad, 
satellite dish, serial port, scanner, or the like. These and other input devices are 
connected to the processing unit 932 through an user input interface 970 that is 
coupled to bus 936, but may be connected by other interface and bus structures, 
such as a parallel port, game port, or a universal serial bus (USB). 

A monitor 972 or other type of display device is also connected to bus 936 
via an interface, such as a video adapter 974. In addition to the monitor, personal 
computers typically include other peripheral output devices (not shown), such as 
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speakers and printers, which may be connected through output peripheral interface 
975. 

Computer 930 may operate in a networked environment using logical 
connections to one or more remote computers, such as a remote computer 982. 
Remote computer 982 may include many or all of the elements and features 
described herein relative to computer 930. 

Logical connections shown in Fig. 6 are a local area network (LAN) 977 
and a general wide area network (WAN) 979. Such networking environments are 
commonplace in offices, enterprise-wide computer networks, intranets, and the 
Internet. 

When used in a LAN networking environment, the computer 930 is 
connected to LAN 977 network interface or adapter 986. When used in a WAN 
networking environment, the computer typically includes a modem 978 or other 
means for establishing communications over the WAN 979. The modem 978, 
which may be internal or external, may be connected to the system bus 936 via the 
user input interface 970, or other appropriate mechanism. 

Depicted in Fig. 6, is a specific implementation of a WAN via the Internet. 
Over the Internet, computer 930 typically includes a modem 978 or other means 
for establishing communications over the Internet 980. Modem 978, which may 
be internal or external, is connected to bus 936 via interface 970. 

In a networked environment, program modules depicted relative to the 
personal computer 930, or portions thereof, may be stored in a remote memory 
storage device. By way of example, and not limitation, Fig. 6 illustrates remote 
application programs 989 as residing on a memory device of remote computer 
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982. It will be appreciated that the network connections shown and described are 
exemplary and other means of establishing a communications link between the 
computers may be used. 

Exemplary Operating Environment 

Fig. 6 illustrates an example of a suitable operating environment 920 in 
which the exemplary bw-meter may be implemented. Specifically, the exemplary 
bw-meter is implemented by any program 960-962 or operating system 958 in Fig. 
6. 

The operating environment is only an example of a suitable operating 
environment and is not intended to suggest any limitation as to the scope of use of 
functionality of the bw-meter described herein. Other well known computing 
systems, environments, and/or configurations that may be suitable for use with the 
bw-meter include, but are not limited to, personal computers, server computers, 
hand-held or laptop devices, multiprocessor systems, microprocessor-based 
systems, programmable consumer electronics, network PCs, minicomputers, 
mainframe computers, distributed computing environments that include any of the 
above systems or devices, and the like. 

Computer-Executable Instructions 

An implementation of the exemplary bw-meter may be described in the 
general context of computer-executable instructions, such as program modules, 
executed by one or more computers or other devices. Generally, program modules 
include routines, programs, objects, components, data structures, etc. that perform 
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particular tasks or implement particular abstract data types. Typically, the 
functionality of the program modules may be combined or distributed as desired in 
various embodiments. 

Computer Readable Media 

An implementation of the exemplary bw-meter may be stored on or 
transmitted across some form of computer readable media. Computer readable 
media can be any available media that can be accessed by a computer. By way of 
example, and not limitation, computer readable media may comprise computer 
storage media and communications media. 

Computer storage media include volatile and non-volatile, removable and 
non-removable media implemented in any method or technology for storage of 
information such as computer readable instructions, data structures, program 
modules, or other data. Computer storage media includes, but is not limited to, 
RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, 
digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic 
tape, magnetic disk storage or other magnetic storage devices, or any other 
medium which can be used to store the desired information and which can be 
accessed by a computer. 

Communication media typically embodies computer readable instructions, 
data structures, program modules, or other data in a modulated data signal such as 
carrier wave or other transport mechanism and included any information delivery 
media. The term "modulated data signal" means a signal that has one or more of 
its characteristics set or changed in such a manner as to encode information in the 
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signal. By way of example, and not limitation, communication media includes 
wired media such as a wired network or direct-wired connection, and wireless 
media such as acoustic, RF, infrared, and other wireless media. Combinations of 
any of the above are also included within the scope of computer readable media. 

Conclusion 

Although the fast dynamic measurement of bandwidth in a TCP network 
environment has been described in language specific to structural features and/or 
methodological steps, it is to be understood that the fast dynamic measurement of 
bandwidth in a TCP network environment defined in the appended claims is not 
necessarily limited to the specific features or steps described. Rather, the specific 
features and steps are disclosed as preferred forms of implementing the claimed 
fast dynamic measurement of bandwidth in a TCP network environment. 
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CLAIMS: 

1. A method for facilitating speedy communication of packets between 
entities on a network, the method comprising: 

sending a delay-disable command; 

sending a set of packets from a sending entity to a receiving entity. 

2. A method as recited in claim 1, wherein the set of packets includes 
two packets sent back-to-back. 

3. A method as recited in claim 1, wherein the set of packets consists of 
a first bandwidth-measurement packet and a second bandwidth-measurement 
packet, wherein the second packet is sent immediately after the first packet. 

4. A method as recited in claim 1, wherein the network is TCP. 

5. A method as recited in claim 1 , wherein the delay-disable command 
disables the Nagle Algorithm on one or more communication devices on the 
network. 



Lee & Hayes, PLLC 



41 



080800114$ MS1-565US.PA T.APP.DOC 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



6. A method as recited in claim 1, wherein the delay-disable command 
is TCP_NODELAY. 

7. A program module having computer-executable instructions that, 
when executed by a computer, performs the method as recited in claim 1 at an 
application layer in accordance with an OSI model. 

8. A computer-readable medium having computer-executable 
instructions that, when executed by a computer, performs the method as recited in 
claim 1 . 

9. A method for facilitating speedy communication of packets between 
entities on a network, the method comprising: 

sending a set of packets from a sending entity to a receiving entity, wherein 
a transmission delay between packets in the set is intolerable; 

immediately thereafter, sending at least one "push" packet to avert a 
transmission delay between packets in the set, wherein the delay is caused by 
packet buffering of a communication device on the network. 

10. A method as recited in claim 9, wherein the set of packets includes 
two packets sent back-to-back. 
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11. A method as recited in claim 9, wherein the set of packets are 
bandwidth-measurement packets for measuring bandwidth between the sending 
entity and the receiving entity. 

12. A method as recited in claim 9, wherein the communication device 
is a proxy server. 

13. A method as recited in claim 9, wherein the network is TCP. 

14. A program module having computer-executable instructions that, 
when executed by a computer, performs the method as recited in claim 9 at an 
application layer in accordance with an OSI model. 

15. A computer-readable medium having computer-executable 
instructions that, when executed by a computer, performs the method as recited in 
claim 9. 

16. A method for facilitating speedy communication of packets between 
entities on a network, the method comprising: 

sending a set of packets from a sending entity to a receiving entity, wherein 
a transmission delay between packets in the set is intolerable; 

immediately before, sending at least one "priming" packet to avoid a 
transmission delay between packets in the set, wherein the delay is caused by 
flow-control functions of a communication device on the network. 
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17. A method as recited in claim 16, wherein the set of packets includes 
two packets sent back-to-back. 

18. A method as recited in claim 16, wherein the set of packets are 
bandwidth-measurement packets for measuring bandwidth between the sending 
entity and the receiving entity. 

19. A method as recited in claim 16, wherein the network is TCP. 

20. A method as recited in claim 16 further comprising establishing a 
TCP connection between the sending entity to the receiving entity, wherein the 
establishing is just before the sending of the set of packets. 

21. A method as recited in claim 16, wherein the flow-control function 
is the Slow Start Algorithm. 

22. A program module having computer-executable instructions that, 
when executed by a computer, performs the method as recited in claim 16 at an 
application layer in accordance with an OSI model. 
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23. A computer-readable medium having computer-executable 
instructions that, when executed by a computer, performs the method as recited in 
claim 16. 

24. A method for facilitating bandwidth measurement between two 
entities on a network, the method comprising: 

sending a delay-disable command; 

sending a pair of bandwidth-measurement packets from a sending entity to 
a receiving entity. 

25. A method as recited in claim 24 further comprising receiving a 
bandwidth calculation based upon measurements related to the pair of packets. 

26. A method for facilitating bandwidth measurement between two 
entities on a network, the method comprising: 

sending a pair of bandwidth-measurement packets from a sending entity to 
a receiving entity, wherein a transmission delay between packets in the pair is 
intolerable; 

immediately thereafter, sending at least one "push" packet to avert a 
transmission delay between packets in the pair, wherein the delay is caused by 
packet buffering of a communication device on the network. 
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27. A method as recited in claim 26 further comprising receiving a 
bandwidth calculation based upon measurements related to the pair of packets. 

28. A method for facilitating bandwidth measurement between two 
entities on a network, the method comprising: 

sending a pair of bandwidth-measurement packets from a sending entity to 
a receiving entity, wherein a transmission delay between packets in the pair is 
intolerable; 

immediately before, sending at least one "priming" packet to avoid a 
transmission delay between packets in the pair, wherein the delay is caused by 
flow-control functions of a communication device on the network. 

29. A method as recited in claim 28 further comprising receiving a 
bandwidth calculation based upon measurements related to the pair of packets. 

30. A computer-readable medium having computer-executable 
instructions that, when executed by a computer, perform a method to facilitate 
speedy communication of packets between entities on a network, the method 
comprising: 

sending a delay-disable command; 

sending a set of packets from a sending entity to a receiving entity. 
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31. A computer-readable medium having computer-executable 
instructions that, when executed by a computer, perform a method to facilitate 
speedy communication of packets between entities on a network, the method 
comprising: 

sending a set of packets from a sending entity to a receiving entity, wherein 
a transmission delay between packets in the set is intolerable; 

immediately thereafter, sending at least one "push" packet to avert a 
transmission delay between packets in the set, wherein the delay is caused by 
packet buffering of a communication device on the network. 

32. A computer-readable medium having computer-executable 
instructions that, when executed by a computer, perform a method to facilitate 
speedy communication of packets between entities on a network, the method 
comprising: 

sending a set of packets from a sending entity to a receiving entity, wherein 
a transmission delay between packets in the set is intolerable; 

immediately before, sending at least one "priming" packet to avoid a 
transmission delay between packets in the set, wherein the delay is caused by 
flow-control functions of a communication device on the network. 
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33. An apparatus comprising; 
a processor; 

a transmission-delay avoider executable on the processor to: 
send a delay-disable command; 

send a set of packets from a sending entity to a receiving entity. 

34. An apparatus comprising: 
a processor; 

a transmission-delay avoider executable on the processor to: 

send a set of packets from a sending entity to a receiving entity, 

wherein a transmission delay between packets in the set is intolerable; 

immediately thereafter, send at least one "push" packet to avert a 

transmission delay between packets in the set, wherein the delay is caused 

by packet buffering of a communication device on the network. 

35. An apparatus comprising: 
a processor; 

a transmission-delay avoider executable on the processor to: 

send a set of packets from a sending entity to a receiving entity, 

wherein a transmission delay between packets in the set is intolerable; 

immediately before, send at least one "priming" packet to avoid a 

transmission delay between packets in the set, wherein the delay is caused 

by flow-control functions of a communication device on the network. 
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36. A modulated data signal having data fields encoded thereon 
transmitted over a communications channel, comprising: 

a first field including a delay-disable command 

a second field including a first bandwidth-measurement packet; 

a third field including a second bandwidth-measurement packet. 

37. A modulated data signal having data fields encoded thereon 
transmitted over a communications channel, comprising: 

a first field including a first bandwidth-measurement packet 
a second field including a second bandwidth-measurement packet; 
a third field including a "push" packet facilitating minimization of 
transmission delay between the first and second packets, wherein the delay is 
caused by packet buffering of a communication device on the network. 

38. A modulated data signal having data fields encoded thereon 
transmitted over a communications channel, comprising: 

a first field including a "priming" packet; 

a second field including a first bandwidth-measurement packet; 

a third field including a second bandwidth-measurement packet; 

wherein the "priming" packet facilitates minimization of transmission delay 
between packets, wherein the delay is caused by flow-control functions of a 
communication device on the network. 
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ABSTRACT 

The fast dynamic measurement of bandwidth in a TCP network 
environment utilizes a single pair of packets to calculate bandwidth between two 
entities on a network (such as the Internet). This calculation is based upon the 
packet-pair technique. This bandwidth measurement is extremely quick. On its 
journey across a network, communication devices may delay the packet pairs. In 
particular, TCP networks have two algorithms designed to delay some packets 
with the goal of increasing the overall throughput of the network. However, these 
algorithms effectively delay a packet pair designed to measure bandwidth. 
Therefore, they distort the measurement. These algorithms are Nagle and Slow 
Start. The fast dynamic measurement of bandwidth implements countermeasures 
to overcome the delays imposed by these algorithms. Such countermeasures 
include disabling the application of the Nagle Algorithm; minimizing the buffering 
of packets by sending a "push" packet right after the packet pair; and avoiding the 
Slow Start Algorithm by priming it with a dummy packet. 
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